Picture for Jieyu Zhang

Jieyu Zhang

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Add code
Jan 15, 2026
Viaarxiv icon

SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning

Add code
Dec 15, 2025
Viaarxiv icon

MolmoAct: Action Reasoning Models that can Reason in Space

Add code
Aug 12, 2025
Figure 1 for MolmoAct: Action Reasoning Models that can Reason in Space
Figure 2 for MolmoAct: Action Reasoning Models that can Reason in Space
Figure 3 for MolmoAct: Action Reasoning Models that can Reason in Space
Figure 4 for MolmoAct: Action Reasoning Models that can Reason in Space
Viaarxiv icon

CoAct-1: Computer-using Agents with Coding as Actions

Add code
Aug 05, 2025
Figure 1 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 2 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 3 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 4 for CoAct-1: Computer-using Agents with Coding as Actions
Viaarxiv icon

Spatial Mental Modeling from Limited Views

Add code
Jun 26, 2025
Figure 1 for Spatial Mental Modeling from Limited Views
Figure 2 for Spatial Mental Modeling from Limited Views
Figure 3 for Spatial Mental Modeling from Limited Views
Figure 4 for Spatial Mental Modeling from Limited Views
Viaarxiv icon

One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory

Add code
May 29, 2025
Viaarxiv icon

H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from Videos

Add code
May 17, 2025
Figure 1 for H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from Videos
Figure 2 for H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from Videos
Figure 3 for H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from Videos
Figure 4 for H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from Videos
Viaarxiv icon

Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems

Add code
Apr 30, 2025
Figure 1 for Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems
Figure 2 for Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems
Figure 3 for Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems
Figure 4 for Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems
Viaarxiv icon

Nemotron-Research-Tool-N1: Tool-Using Language Models with Reinforced Reasoning

Add code
Apr 25, 2025
Viaarxiv icon

Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base

Add code
Mar 30, 2025
Viaarxiv icon